Cluster analysis and association study of structured multilocus genotype data
نویسندگان
چکیده
منابع مشابه
Discriminatory Association Analysis on Semi-structured Data
Data mining has been applied to the discovery of illegally discriminatory treatments caused by protected-by-law attributes such as race, gender, age, etc. In this paper, we propose an improvement for the previous work of exploring discrimination in semi-structured business data. The main idea is that discrimination represented in the form of association rules is judged by opposite patterns whos...
متن کاملACPA: automated cluster plot analysis of genotype data
Genome-wide association studies have become standard in genetic epidemiology. Analyzing hundreds of thousands of markers simultaneously imposes some challenges for statisticians. One issue is the problem of multiplicity, which has been compared with the search for the needle in a haystack. To reduce the number of false-positive findings, a number of quality filters such as exclusion of single-n...
متن کاملParentage and sibship inference from multilocus genotype data under polygamy.
Likelihood methods have been developed to partition individuals in a sample into sibling clusters using genetic marker data without parental information. Most of these methods assume either both sexes are monogamous to infer full sibships only or only one sex is polygamous to infer full sibships and paternal or maternal (but not both) half sibships. We extend our previous method to the more gen...
متن کاملInference of population structure using multilocus genotype data.
We describe a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations. We assume a model in which there are K populations (where K may be unknown), each of which is characterized by a set of allele frequencies at each locus. Individuals in the sample are assigned (probabilistically) to populations, or jointly to two or...
متن کاملAnalysis of multilocus models of association.
It is increasingly recognized that multiple genetic variants, within the same or different genes, combine to affect liability for many common diseases. Indeed, the variants may interact among themselves and with environmental factors. Thus realistic genetic/statistical models can include an extremely large number of parameters, and it is by no means obvious how to find the variants contributing...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Human Genetics
سال: 2005
ISSN: 1434-5161,1435-232X
DOI: 10.1007/s10038-004-0220-x